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Abstract. Let M.(x) denote the ring of polynomials in g freely non-commuting 
variables x = (x\, . . . , x g ). There is a natural involution * on M.(x) determined 
to/} by x* = Xj and (pq)* = q*p* and a free polynomial p 6 M(x) is symmetric 

if it is invariant under this involution. If X = [Xi, . . . , X g ) is a g tuple of 
symmetric n X n matrices, then the evaluation p(X) is naturally defined and 
further p*(X) = p(X)* . In particular, if p is symmetric, then p(X)* = p(X). 
The main result of this article says if p is symmetric, p(0) = and for each n 
and each symmetric positive definite nxn matrix A the set {X : A— p(X) y 0} 
is convex, then p has degree at most two and is itself convex, or — p is a 
hcrmitian sum of squares. 



1. Introduction 

Let WL{x) denote the ring of polynomials over R in the freely non-commuting 
variables x = (x\, . . . , x g ). Ape M.(x) is a free polynomial and is a finite sum 



P 



over words w in x with coefficients p w £ K. The empty word, which plays the role 
of the multiplicative identity, will be denoted 0. 
For a word 

(1.1) w = x jl x h ---x jk , 

let 

w = x jk ' ' ' x h x ji ■ 
The operation * extends naturally to an involution on R(x) by 

p = 2_^P wW ■ 

Let S„(]R 9 ) denote the set of g-tuples X = (Xi, . . . , X g ) of n x n symmetric 
matrices. For a word w as in (|l.ip . substituting Xj for Xj gives, 

X w =w(X)=X jl X j2 .--X jk . 

This evaluation extends to in the obvious way, 

Observe, for 6 § n (R s ) that p(0) — pain, where /„ is the nxn identity. 

The transpose operation * on matrices is compatible with the involution * on 
R(x) in that 

p(X)*=p*(X). 
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A polynomial p G R(x) is symmetric if p — p* and in this case p(X)* = p*(X) = 
p(X) so that p takes symmetric values. 

Let S„(R) denote the collection of symmetric n x n matrices. Given S G § n (R) 
the notations S y and S y indicate that S is positive definite and positive 
semidefinite respectively. A symmetric p G R(cc) is quasi-convex if p(0) = and for 
each n and positive definite matrix n x n matrix A the set 

V(A) = {X G §„(R 9 ) : A - p(X) y 0} 

is convex. 

A symmetric polynomial p is a (hermitian) sum of squares if there exists an m 
and hi, ... , h m G R(a;) such that 

Evidently such a p is positive in the sense that for each n and X G §„(R S ), 

p(x) y o. 

The following theorem is the main result of this article. 

Theorem 1.1. If p is quasi-convex, then either —p is a sum of squares, or there 
exists a linear polynomial I G R(x) and finitely many linear polynomials Sj G R(a;} 
such that 

(1.2) p(x)^£(x)+Y / s*(x)s J (x). 

Thus, p is a hermitian sum of squares of linear polynomials plus a linear term. 

Further, if there is an N such that for each n > N there is a B € §„ (R) such 
that B )t and {X G §„(R 9 ) : B — p(X) y 0} is convex, then —p is a sum of 
squares if and only if p = 0. 

Remark 1.2. It is easy to see that if p is a hermitian sum of squares of linear 
polynomials plus a linear term, then p is quasi-convex. 

1.1. Related results and remarks. Theorem 11.11 falls within the emerging fields 
of free analysis and free semialgebraic geometry. Free semialgebraic geometry is, 
by analogy to the commutative case, the study of free polynomial inequalities. For 
instance there are now a number of free Positivstellensatze for which [5] [TS] [IB] 
[TT] are just a few references. In this regard, see also |H] and its Proposition 
17. There is also a theory of free rational functions. Recent developments in 
this direction have been related to non-commutative multi-variate systems theory. 
See for instance pQ. Free rational functions actually appeared much earlier in the 
context of finite automata. See for instance [20] . Issues of convexity in the context of 
free polynomials and rational functions naturally arise in systems theory problems 
[HI [14] and mathematically are related to the theory of operator spaces and systems 
and matrix convexity [6] . More generally, there is a theory of free analytic functions 
which arise naturally in several contexts, including free probability. A sampling of 
references includes [53] [T3J [H] and [15] , 

Some systems theory problems present as a free polynomial inequality (or more 
realistically as a system of matrix-valued free polynomial inequalities) involving 
two classes of free (freely non-commuting) variables, say the a variables and the x 
variables. The a variables are thought of as known (system or state) parameters 
and the x variables unknowns. For a given free polynomial q, of interest is the case 
that, for each fixed A in some distinguished collection of known parameters, the 
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inequality q(A, x) >~ is convex in x. Thus this article considers the simplest such 
case. Namely there is just one a variable and q(a, x) = a— p(x) for a polynomial p in 
the variables x alone. For comparison, a main result of [5] says, generally, if q(A, x) 
is convex in x for each fixed A, then q(a, x) — L{a, x) + Y] hj(a, x)*hj(a, x) where 
L has degree at most one in x and each hj is linear in x. The articles [3l SI [5] [9] 
contain results for polynomials / whose positivity set - namely the set of those X 
such that f(X) >- - is convex. 

A symmetric polynomial p is matrix convex if for each n, each pair X, Y 6 
S„(R ff ), and each < t < 1, 

p(tx + (i - t)Y) < tp{x) + (i - t)p(r). 

The following Theorem, pointed out by the referee, generalizes the main result of 

ma- 

Theorem 1.3. For a symmetric polynomial p the following are equivalent. 

(i) p — p(0) has the form in Equation (jTTSJ) ; 
(ii) p is matrix convex; 
(Hi) T>{A) is convex for every A £ S n (R 9 ); 

(iv) p(x) — p(0) is quasi-convex and p(0) — p(x) is not a nonzero sum of squares. 

A remark is in order about the definition of quasi-convex used here. A classical 
definition says that a function / of several (commuting) variables is quasi-convex 
if each of its sub-level sets is convex. (The interested reader can work out the rela- 
tionships between this definition of quasi-convex and the seemingly more popular 
one f(tx + (1 — t)y) < max{/(i), f(y)} ioi < t < 1.) In considering a free analog 
here, because of the role that positivity plays in the arguments, it was convenient 
to make the harmless normalization that p(0) = and then only require convexity 
of the (level) sets T>(A) for A positive definite in the definition of quasi-convex. 

1.2. Reader's guide. The remainder of this paper is organized as follows. Issues 
surrounding sums of squares are dealt with in Section[2] Just as in the commutative 
case, convexity is related to positivity of a Hessian. The necessary definitions and 
basic results appear in Section [3l Section |4] examines membership in the boundary 
of the set T>(A) as well as consequences of the convexity hypothesis. Theorem ll.il 
is proved in the final section, Section [SJ 

We thank the referee for many corrections, suggestions and references related to 
this article. 

2. The Sum of Squares Case 

The following proposition dispenses with the alternative that — p is a sum of 
squares. 

Proposition 2.1. If there is an N such that for each n > N there exists a B G 
§„(K) such that B ^ and the set T>(B) is convex, then —p is not a nonzero sum 
of squares. 

Proof. Arguing by contradiction, suppose — p is a sum of squares. Consider the 
polynomial 

2d 

-p(tx) = J2Pj( x ) t3 ' 
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where the pj are homogeneous of degree j polynomials in the free variables x. 
Since — p is a sum of squares, —p(tx) has even degree as a polynomial in t, and 
so, without loss of generality, we may assume that p2d is nonzero. Now P2d is 
itself a sum of squares and hence it takes positive semi-definite values. If, on the 
other hand, p2d(X) is never positive definite, then det(p2d(X)) = for all n and 
X G §„(R 9 ). An application of the Guralnick-Small lemma as found in [8] then 
gives the contradiction that p2d is the zero polynomial. Thus there is an n and an 
X G §„(R 9 ) such that p2d(X) y 0. By assumption, there is a B G § n (M) such that 
B )£_ and T>(B) is convex. Choosing t sufficiently large, it may be assumed that 
both B —p{tX) and B — p{—tX) are positive definite. In this case tX and — tX are 
in T>(B) and thus = \{tX + (~tX)) G £>(£?), contradicting the assumption that 
B is not positive semidefinite and completing the proof. □ 

2.1. When — p is not a sum of squares. Given n, let JC{n) denote the set of 
those X G §„(R 9 ) such that p{X) has a positive eigenvalue. The /C(n) are open 
sets, the issue is whether they are empty or not. In the free setting, and unlike in 
the case of several commuting variables, positive polynomials are sums of squares 
with [ST] [35] and [7J as a very small sampling of the references. (For a reference 
which explicitly treats the case of the symmetric variables (x* — Xj) used here see 
[IT].) In particular, each IC(n) is empty if and only if — p is a sum of squares. 

The conclusion of the following lemma will be used later in the proof of Theorem 
11.11 when — p is not a sum of squares. 

Lemma 2.2. Suppose q G and —p is not a sum of squares. If det(q(X)) = 

for every n and X G /C(n) ; then q = 0. 

Before beginning the proof of Lemma |2~21 we record the following simple fact. 

Lemma 2.3. Suppose q G R(ir) and m is a positive integer. If K{m) is nonempty 
and det(q(X)) = for all X G K{m), then det(q(X)) = for all X G § m (K 9 ). 

Proof. The function § m (IR 9 ) 3 X i-> dct(q(X)) is a polynomial in the entries of X. 
Hence, if it vanishes on an open set it must be identically zero. □ 

Given integers fc, I. Let m = k + I and consider the subspace 



of § m (M 9 ). Each tuple X G S is a direct sum X = (Yi © Z u . . . , Y g © Z g ) where 
Y = (Yi, . . . , Y g ) G S k {R 9 ) and Z = (Z u . . . , Z g ) G ^t{M. 9 ) and where 



Proof of Lemma \2. 6 A Since —p is not a sum of squares, by the remarks at the 
outset of this section, there is an m and a Y G /C(m) such that p(Y) has a positive 
eigenvalue. First observe that for any positive integer k and X G Sfe(R 9 ) that 
p(X © Y) = p(X) (BpiY) too has a positive eigenvalue. Thus, K,(n) is nonempty 
for each n > m. Lemma 12.31 now implies that det(q(X)) = for each n > m and 



X G §„(M 9 ). 

Now suppose 1 < I < m. Since n = mi > m, det[q[X)) = for all X G S„(M 9 ). 
Given I6§,(1»), the tuple 




X = ®?X G § n (R 9 ). 
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Thus (det(g(X))) m = det(q(X)) = 0. Hence det(q(X)) = for every X € S<?(R 9 ). 

Since det(g(X)) = for every n and X £ S n (R s ), the Guralnick-Small lemma in 
[5] implies q is the zero polynomial. □ 

3. Directional Derivatives and the Hessian 

Given p £ R(x) another set of freely non-commuting variables h = (hi, . . . , h g ) 
and the real parameter t, 

p(x + th) = ^p j (x)[h]t j , 

where pj(x)[h] are polynomials in the variables (x,h) — (x\, . . . , x g , hi, . . . , h g ) 
(which are of course freely non-commuting). The notation indicates the different 
role that these variables play. Indeed, observe that pj(x)[h] is homogeneous of 
degree j in h. 

The polynomial p\ (x) [h] is the directional derivative or simply the derivative of 
p (in the direction h) and is denoted p'(x)[h\. The polynomial 2p 2 (x)[h] is the 
Hessian of p and is denoted by p"(x)[h\. 

Given X € S n (R 9 ) and net", let 

T(X,v) = {H 6 §„(R 9 ) : p'(X)[H]v = 0} C S n (M 9 ). 

In the case that (A — p(X))v = 0, the subspace T(X,v) is the clamped tangent 
plane to 2?(A) at (X, v) [3]. In this case, if one chooses H e T(X, v), then 

((A-p(X + tH))v,v) = -^t 2 {p"(X)[H]v,v)+t 3 e(t), 

for some polynomial e(t). This identity, much as in the commutative case, provides 
a link between convexity and positivity of the Hessian of p. 

4. The boundaries 

Fix p satisfying the hypothesis of Theorem ll.il In particular p(0) = 0. 

Lemma 4.1. Let n and a positive definite A 6 S„(R) be given. A given X G §„(M 9 ) 
is in the boundary ofT>(A) if and only if A — p(X) is positive semidefinite and has 
a non-trivial kernel. 

Proof. Suppose that X is in the boundary of T>(A). It follows that A — p(X) >: 0. 
It must be the case that A — p(X) has a non-trivial kernel, because otherwise, 
X e T> (A) which is an open set. 

To prove the converse, suppose A — p(X) is positive semidefinite and has a non- 
trivial kernel. Clearly, X £ V(A). For positive integers n, let A n = ( 2i ^-) A. Then 
A n — p(X) is positive definite. Hence X G T>(A n ) and by convexity of T>(A n ), for 
a fixed < s < 1, A n — p(sX) >- 0. Letting n tend to infinity, it follows that 
A-p(sX) t 0. 

Consider the function tp : R — > R defined by -0(<) = det(A — p(tX)). From what 
has already been proved, t/)(t) > for < t < 1. Since ip(t) is a polynomial in the 
variable t, either it vanishes everywhere on R or only on a finite subset of R. If 
ip(t) vanishes everywhere, then ?/>(0) = det(A) = which contradicts the positive 
definiteness of A. Thus ip(t) > except for finitely many point in (0, 1) and thus 
there is a sequence (s„) from (0, 1) such that each s n X S 'D(A) and s n X — > X. 
Hence X is in the boundary of D(A). □ 
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Suppose that X 6 §„(R 9 ) is in the boundary of V(A) and v ^ is a vector in 
R™ such that Av = p(X)v. 

Proposition 4.2. With X and v as above, if the dimension of the kernel of A—p{X) 
is one, then there exists a subspace W of l~(X,v) of codimension one (in T(X,v)) 
such that, for H e H, 

(4.1) (p"(X)[H]v,v)>0. 

Remark 4.3. Since T(X, v) has codimension at most n in §„(R 9 ), it turns out that 
T-L will have codimension at most n + 1 in §„(R 9 ). In fact, a slight modification of 
the proof below shows that there is a subspace /C of §„(R 9 ) of codimension at most 
n for which equation (|4.1[) holds. The key point is, with A as in the proof of the 
proposition, if A(H) = 0, then (p'(X)[H]v,v) = 0. 

Unlike a related argument in [3] , the proof here does not rely on choosing a curve 
lying in the boundary of a convex set, thus eliminating the need for a corresponding 
smoothness hypothesis. 

Proof. Since X is in the boundary of the convex set T>(A), there is a linear functional 
A : §„(R 9 ) -> R such that A(Z) < 1 for Z e V(A) and A(X) = 1. The subspace 

H = {H S T{X, v) : A(H) = 0} 

has codimension one in T(X,v). 

Fix H E n and define F : R -)• S„(R) by F(t) = A — p(X + tH). Thus, F is a 
matrix-valued polynomial in the real variable t. Let [v] denote the one dimensional 
subspace of R" spanned by the vector v. Write F(t), with respect to the orthogonal 
decomposition of R™ as [v] 1 - © [v], as 

F(f] _ fQ(t) 9(t)\ 

[) -\g(ty fit)) 1 

where Q is a square matrix-valued polynomial, g is a vector, and / is a scalar- 
valued polynomial. The assumption (A — p(X))v = implies that / and g vanish 
at 0. The further assumption that H e T(X, v) implies that / and g actually 
vanish to second order at 0. In particular, there are polynomials /3 and 7 such that 
g{t)=t 2 (i{t) and/(t)=t 2 7 (t). 
Observe that 

1 (0) = -(p"(X)[H]v,v). 
Thus, to complete the proof of the theorem it suffices to use the choice of A (and 
thus the convexity hypothesis on T>{A)) and the assumption on the dimension of 
the kernel of A —p(X) to show that 7(0) < 0. Indeed, since the kernel of A — p(X) 
has dimension one, it follows that Q(0) >- 0. Therefore, there exists an e > such 
that if \t\ < e, then Q(t) y 0. On the other hand, A{X + tH) = A(X) = 1 for all t. 
Thus X + tH V(A) which means F(t) = A - p(X + tH) ^ 0. Hence, the Schur 
complement of F is nonpositive; i.e., 

t 2 [ 7 (t)-t 2 p*(t)Q-\t)!3(t)}<0. 
It follows that, for \t\ < e, 

l{t)<t 2 p*{t)Q- 1 {t)[3{t) 
and hence 7 (0) < 0. □ 
We end this section with the following simple observation. 
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Lemma 4.4. Suppose X £ §n(R s ) and ^ v £ R™. J/ i/iere is a A > swc/i 
that p(X)v = Xv, then there exists a positive definite A £ §„(R) such that X is in 
the boundary ofD(A) and v spans the kernel of A —p(X). Hence, for the triple 
{A, X, v) the conclusion of Proposition \4-£\ holds. 

Further, if, for a given positive definite A £ S n (R), X is in the boundary of 
T>{A), and v is a nonzero vector such that (A — p{X))v = 0, then for each e > 
there is a A e > such that \\A — A e \\ < e, X is in the boundary ofV(A e ) and the 
kernel of (A e — p(X)) is spanned by v. 



Proof. With respect to the decomposition of R™ as [v] ® [v} ± , 

'X N 
T, 



for some symmetric matrix T. Choose fi > so that \x — T >~ and let 

'A N 



A 

In particular, 



jti 



A-p(X) 



'0 

is positive semidefinite with one dimensional kernel spanned by v. From Lemma 
|4~T1 X is in the boundary of T>(A). 

As for the further statement, diagonalize with respect to same orthogonal de- 
composition of l n as above 

'0 o x 



A-p(X) 



T 



for some positive semidefinite T. Let P denote the projection onto [v] and let 
A t =A + eP, then 

'0 
v e + T y 

and the result follows. □ 



Ae-p(X) 



5. Direct Sums and Linear Independence 

As in Section^ let K.(n) denote the set of those X S §„(R 9 ) such that p(X) has 
a positive eigenvalue. From here on, it is assumed that — p is not a sum of squares. 
Equivalently, JC{m) is not empty for some m. 

Let K,(n) denote the set of pairs (X, v) such that X £ JC(n) and v is an eigenvector 
of p(X) corresponding to a positive eigenvalue. By Lemma f4.4[ if (X,v) £ tC(n), 
then there exists a positive definite A £ § n (M) such that X is in the boundary of 
T>(A) and the kernel of A — p(X) is spanned by v. Let (x)k denote the set of words 
of length at most k. 

Lemma 5.1. Fix a positive integer k. Given X £ §„(R 9 ) and v £ R ra , there is a 
nonzero q £ M.(x) of degree at most k such that q(X)v = if and only if the set 
{w(X)v : w £ (x)k} is linearly dependent. 

If q £ K(x) and q(X)v — for all n and (X,v) £ /C(n), then q = 0. 

Proof. The first statement is evident. As for the second, the hypotheses imply that 
det(g(A)) — for each n and X £ K.(n). Hence by Lemma [2721 q = 0. □ 
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Lemma 5.2. Let d denote the degree of p. Given a positive integer N , there exists 
an 11 > N and a pair (X, v) with X £ §„(R 9 ) and s/0 in M n such that 

(i) there is a subspace % of T(X,v) of codimension at most one such that, for 
all H en, 

(p"(X)[H]v,v)>0; 
(ii) if q is of degree at most d — 1 such that q(X)v — 0, then q = 0. 

Proof. Let V denote the vector space of polynomials in g variables of degree at 
most d — 1. Given (Y, w) £ IC(n), let 

Q(Y,w) = {qeV : q(Y)w = 0}. 

Thus, Q(Y, w) is a subspace of the finite dimensional vector space V. Further, by 
Lemma 15.11 

n{Q(F, w) : (Y, w) e /C(n), n£N} = {0}. 

Because of finite dimensionality, there are positive integers t and n\,...,n t and 
(Y j , w j ) G /C(rij) such that 

n$ =1 Q(i"V') = {0}. 

In particular, if q G M.(x) has degree at most d— 1 and qiY^vji = for j = 1, . . . ,i, 
then q — 0. 

Let Z = ©y- 7 and z = ffiw 5 . Thus Z acts on a space of dimension n' ~ n j ■ 
Choose a positive integer k such that n — kn' > N and let X = ®\Z and v = ©^z. 
From the definition of IC(n) and by Lemma l4~4l for each j there is a positive definite 
A 3 e S nj (M) such that F- 7 is in the boundary of V(Aj) and (Aj -p{Y^))w^ = 0. Let 
B = ®A and A' = @\B. Then (A' -p(X))v = and A'-p(X) h 0. Moreover, if 
g has degree at most d — 1 and q(X)v = 0, then q = 0. 

Finally, choose a positive definite A e §„ (R) by the second part of Lemma 14.41 
such that X is in the boundary of T>(A) and the kernel of (A — p(X)) is spanned 
by v . In particular, X is in the boundary of T>(A). The triple (A, X, v) satisfies 
the hypotheses of Proposition 14.21 Hence there is a subspace H of T(X, v) of 
codimension at most one such that 

(p"(X)[H]v,v) > 

for all HeH. □ 
The symmetric polynomial 

r(x)[h] =p"{x)[h] 

in the 2g variables (x\, . . . , x g , h%, . . . , h g ) is homogeneous of degree two in h. It 
admits a representation of the form 

r(x)[h] - [Vo(x)[h] T ---V d ^ 2 (x)[h] T ] Z(x) 

where Z{x) is a (uniquely determined square symmetric) matrix of free polynomials 
and Vj (x) [h] is the vector with entries hgw over free words w of the the variables 
xi, . . . , x g of length j and 1 < I < g. (For details see [1].) The matrix Z = Z(Q) is 
the middle matrix. 



V (x)[h] 
V d - 2 (x)[h 
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Lemma 5.3. If Z is positive semidefinite, then p has degree at most two and 
moreover, p has the form in Equation (|1.2|) . 

A proof can be found in [10] . The idea is that the middle matrix Z has an 
antidiagonal structure which implies, if it is positive semidefinite, then its only 
nonzero entries correspond to Vo[/i], which is linear in h and independent of x. 
Thus, 

r{x)[h] = r[h] = V [h] T ZV [h] 

and it can be shown that Z must be positive semidefinite. Writing Z as a sum of 
squares and using 

p(x) = £(x) + -r(x)[x] 

expresses p in the form of Equation (|1.2[) . 

The following Lemma is a consequence of Lemma 7.2 from pQ. 

Lemma 5.4. There is an integer v depending only upon the degree d of the poly- 
nomial p and the number g of variables such that the following holds. If 

(i) n>v satisfies < 1; 
(ii) X G §„(R 9 ) andv e R"; and 

(Hi) there exists a subspace ri of T~(X,v) of codimension at most one such that for 

each H € ri Equation (|4.1[) holds; and 
(iv) there does not exist a nonzero polynomial q of degree at most d — 1 satisfying 

q{X)v = 0, 

then Z is positive semidefinite. 

To prove Theorem 11.11 simply observe that the existence of an n that satisfies 
the conditions (i) - (iv) of Lemma [5^41 is guaranteed by Lemma [572! The conclusion 
Z is positive semidefinite combined with Lemma 15.31 now completes the proof. 

It remains to prove Theorem 11.31 The equivalence of conditions (i) and (ii) is 
the main result of [ID] . That (ii) implies (iii) is easily checked. If (iii) holds, then, 
by definition, p(x) —p(0) is quasi-convex. Moreover, by the second part of Theorem 
II. U condition (iii) implies p(0) — p(x) is not a nonzero sum of squares. Hence (iii) 
implies (iv). If (iv) holds, then Theorem 11.11 implies p{x) — p(0) is a linear term 
plus a (hermitian) sum of squares of linear polynomials and thus (i) holds. 
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